Extremal Real Algebraic Geometry and A- Discriminants 1 



Alicia Dickenstein* J. Maurice Rojas^ Korben Rusek* Justin Shih§ 

February 2, 2008 

For Askold Georgevich, with admiration and respect. Happy 60! 

Abstract 

We present a new, far simpler family of counter-examples to Kushnircnko's Conjecture. 
Along the way, we illustrate a computer-assisted approach to finding sparse polynomial systems 
with maximally many real roots, thus shedding light on the nature of optimal upper bounds in 
real fewnomial theory. We use a powerful recent formula for the ^4-discriminant, and give new 
bounds on the topology of certain .A-discriminant varieties. A consequence of the latter result 
is a new upper bound on the number of topological types of certain real algebraic sets defined 
by sparse polynomial equations. 

1 Introduction 

The algorithmic study of real solutions of systems of multivariate polynomial equations is central 
in science and engineering, as well as in mathematics. For instance, entire fields such as Computer 
Aided Geometric Design and Control Theory essentially revolve on basic but highly non-trivial 
questions involving certain structured polynomial systems (see, e.g., |GK03[ |RS98| ). Furthermore, 
polynomial systems whose real roots lie outside the reach of current algorithmic techniques regularly 
occur in a myriad of industrial problems, and many of these problems involve sparse polynomial 
systems, i.e., equations with "few" monomial terms. Understanding the number of real solutions 
of sparse polynomial equations is thus fundamentally important in real algebraic geometry. 

Here we shed light on the difficulty behind determining the maximal number of real roots of 
polynomial systems with a fixed number of exponent vectors. We give new, dramatically simpler 
counter-examples (in Theorem 11.1 1 below) to an earlier conjectural upper bound of Kushnirenko. A 
consequence of our investigation is a precise quantitative statement that (for many fixed choices of 
exponent vectors) sparse polynomial systems with maximally many real roots are very rare: they 
lie in extremely small chambers — determined by a suitable discriminant variety — in the space 
of coefficients (Theorem 11.21 below). Moreover, we prove an explicit upper bound on the number 
of such chambers (see the proof of Theorem 11.31 below). This in turn implies a new upper bound 
on the number of smooth topological types attainable in families of real algebraic sets defined by 
certain sparse polynomials ( Theorem 11.31 and Corollary 11.41 below). 
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The techniques of our paper actually extend to general ^.-discriminants and counting topological 
types of real zero sets of general sparse polynomials. The latter results will appear in a forthcoming 
paper. However, the special cases covered here already yield new results on extremal real algebraic 
geometry, which we now review in detail. 

1.1 Background on Extremal Estimates 

In a book published in June 1637, Rene Descartes stated that any real univariate polynomial 
with exactly m monomial terms has at most m — 1 positive roots |SL54j . Unlike the behavior 
of complex roots, Descartes' bound on the number of real roots is completely independent of the 
degree of the polynomial. However, nearly four centuries later, we still lack a definitive analogue 
for systems of multivariate polynomials. Great progress was made by Khovanskii and Sevastyanov 
|Kho80l IKho91] around the late 1970's, culminating in an explicit upper bound for the number of 
non-degenerate positive roots of general sparse polynomial systems. This bound — a very special 
case of Khovanskii's Theorem on Real Fewnomials |Kho91j - revealed that the maximal 
number of isolated real roots of polynomial systems with a fixed number of exponent vectors is 
independent of the sizes of the exponent vectors. Khovanskii's theory has since enabled important 
advances in many different areas, e.g., Hilbert's 16— Problem |Kal03| . algorithmic complexity 
[GVOll IVG03} IBRS06] . the study of torsion points on algebraic curves |CZ02| . and model theory 
[Wil99| . to name but a few (see also the conclusion and bibliography of [Kho91] ). 

Khovanskii's original bound is now known to be far from tight (see, e.g., |LRW03l IBS06] ) . More 
to the point, finding general optimal bounds is a decades-old problem whose solution would have 
significant impact outside, as well as inside, real algebraic geometry. Unfortunately, finding optimal 
bounds even for two equations in two unknowns — with just three terms each — turned out to be 
difficult enough to take close to 20 years to do. 

To clarify this difficulty, consider the following polynomial system, which we will call the Haas 
system with parameters (a, 6, d): 

_ jhi(x,y) :=x 2d + ay d -y 
a (a,b,d)- \h 2 (x,y) :=y 2d + bx d -x 
Letting M* := R \ {0} and letting M + denote the positive ray, we call the collection of systems 
{ H (a,b,d)} (a,M)e(M*) 2 xN the Haas family. 

Definition 1.1. Given any f 6 R[xi, . . . ,x n ] with f(x) = YliLi c i x<li (where the a{ are pair-wise 
distinct and all Ci are nonzero]^ we call f a (real) n-variate m-nomial. Also, given /i, . . . 
with fi an n-variate mi-nomial for all i, we call F := (/i, . . . , f^) a k x n fewnomial system of 
type (mi, . . . , to^). Finally, if the total number of distinct exponent vectors among the fi is m, 
then we can also call F an ?n-sparse k x n fewnomial system, o 

Thus, for example, any system from the Haas family can be referred to as (a) a 2 x 2 fewnomial 
system of type (3,3), (b) a 2 x 2 trinomial system, or (c) a 6-sparse 2x2 fewnomial system. Note 
also that H^ a b d ^ has the same roots in (M*) 2 as {h\{x, y)/y, h 2 (x, y)/x), which is 5-sparse. 

The aforementioned special case of Khovanskii's Theorem on Real Fewnomials (invoking an 
improvement observed by Daniel Perrucci [Per05| ) states that an m-sparse n x n fewnomial system 
never has more than (n + i) m - 1 2( m ~ 1 )( m ~ 2 )/ 2 non-degenerate roots in the positive orthant . 
This in turn implies that the maximal number of non-degenerate roots in IR^_ of any H^ ab( ^ in the 
Haas family is no more than 5184, since we can replace any H( a b d ^ by a 5-sparse system with the 
same roots in R 2 ^. (cf. the preceding paragraph). 

1 For any a £R™, the notation a := (ai, . . . , a„) and x a — x" 1 ■ ■ ■ x" n will be understood. 
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Anatoly Kushnirenko, also around the late 1970s, conjectured a significant sharpening of Kho- 
vanskii's bound: Kushnirenko's Conjecture was the statement that nx n fewnomial systems of 
type (mi, . . . , m n ) never have more than nr=i( m « — -0 non-degenerate roots in the positive orthant 
WL- This conjecture, if true, would have implied that the maximal number of non-degenerate roots 
in of any H^ a b d ^ in the Haas family is 4, thus pointing to a rather large gap. (It is a simple 
exercise to construct 2x2 trinomial systems having 4 non-degenerate roots in 

A bit of Gaussian elimination easily reveals that Kushnirenko's conjectural bound, if true, would 
have implied an elegant upper bound of (m — n) n for the number of non-degenerate roots of any 
m-sparse nx n fewnomial system |LRW03| Prop. 1] . Since m-sparse nx n fewnomial systems have 
no isolated roots in R" when m<n [LRW03, Prop. 1 and Thm. 4], the case where n is fixed and m 
is large becomes a natural question. Kushnirenko's Conjecture (or even an upper bound of the form 
0{m) n £ for some e>0) — if true — would have thus been a significant asymptotic improvement 
to Khovanskii's bound. 

Remark 1.2. Curiously, over a different metrically complete field — the p-adic rationals Q p , 
for any fixed prime p - - it is now known that the number of geometrically isolated roots in 
Qp is 0((m — n) log(m — n)) 3n , for fixed n and large m jRojO^ . Also, in the complementary 
setting of fixed m — n and large n, the number of non- degenerate roots in is now known to be 
0{n) m - n ~ l WSUBj . o 

According to Bertrand Haas (and conversations of the second author with Dima Yu. Grigor'ev 
and Askold Khovanskii, on or before September 2000), Kushnirenko saw a simple counter-example 
to his conjecture shortly after he stated it in the late 1970's. Unfortunately, no one ever recorded 
this counter-example, or the identity of its author. Fortunately, Haas proved in 2000 |Haa02j . via 
an ingenious elementary argument, that the system 

x 106 + l.ly 53 - l.ly 
y 106 + l.lx 53 - l.lx, 

along with many others with nearby real exponents, has at least 5 roots in M.^_. Shortly after, Li, 
Rojas, and Wang proved that all 2x2 trinomial systems (and, in particular, all systems in the Haas 
family) have at most 5 isolated roots in jLRW03j . The latter trio of authors also significantly 
sharpened Khovanskii's bound for certain other families of n x n sparse polynomial systems. 

Haas' example above was thus the simplest known counter-example to Kushnirenko's Conjec- 
ture, until the present paper. 

Remark 1.3. It is interesting to observe that the existence of a pair of real bivariate polynomials 
F:=(/i,/2) — with fi a trinomial and fi a tetranomial — having more than 6 isolated roots in 
R+, is still an open problem. The maximal number is known to be at least 6, and no larger than 14 
ILBWO-A Thm. 1, Assertion (a)], o 

1.2 Main Results 

1.2.1 New Counter-Examples and the Probability of Finding One 

We give a new family of counter-examples far simpler than that of Haas, and announce what appear 
to be many more such families. In particular, while Haas found a pair of trinomials of degree 106 
(and many more of higher degree) , we given an explicit cell in M 2 which is naturally identified with 
an infinite family of pairs of trinomials of degree 6. We have also found experimentally 49 other such 
cells (consisting of 2 x 2 systems of trinomial of even degree < 106) which are all counter-examples 
to Kushnirenko's Conjecture as well, but we focus here on the simplest. 
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Proposition 1.4. Let Ed CR 2 denote the set of (a, b) such that -ff( a ,&,cZ) has at least 5 non- degenerate 
isolated roots in the positive quadrant. Then Ed is open and symmetric about the line {a = b}. ■ 



Theorem 1.1. Following the notation of Proposition [L4\ £3 is non-empty (and in fact star con- 

x e + ||y 3 — y , y e + ||j; 3 — x ) has 



ves^). In particular, f|) G £3, cmd t/ius i/ie 2x2 system 
exactly 5 roots, all non- degenerate, in K 2 , . 



We point out that while our first verification of our simplest counter-example was done via Grobner 
bases (on the computer algebra system Maple), we present here a novel (and simpler) numerical 
verification via Smale's Alpha Theory |Sma861 lBC^S98] . 

Our next result reveals that for small d, it is highly unlikely that a random choice of (a, b) will 
yield H^ a h d ) as a counter-example to Kushnirenko's Conjecture. 



Theorem 1.2. Following the notation of Proposition \TJj\ E\ and E2 are empty, and < Area(Es) < 
5.701 x 10~ 7 . In particular, letting a and b be independent identically distributed standard real 
Gaussian random variables, Prob[(a, b) G £3] < 1.936 x 10~ 9 . Finally, let a be the smallest positive 
root of the univariate degree 7 polynomial C(a), which we define as 

823564528378596a 7 - 7917064766635392a 6 - 195134969401159896a 5 - 668651015982750336a 4 - 19908809569295316a 3 - 564987948607350000a 2 + 2392425241171875000a - 8620460479736328125, 
and let (3 < 7 be the two smallest positive roots of the univariate degree 36 polynomial A(a) in the 
Appendix. Then the boundary of £3 consists of exactly 4 convex arcs, meeting exactly at 4 vertices: 
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To 10 decimal places, the preceding coordinates are 
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This paucity of extremal examples can be visualized most easily by plotting the regions in the space 
of coefficients that yield H^ a b d ) with a given number of roots in M^o Below is a sequence of 4 such 
plots (for d = 3), drawn on a logarithmic scale and successively magnified up to a factor of about 
1700. 





The small red (or grey) diamond- like region on the right-most plot is £3. The connected components 
of the complement of the underlying real curve are regions on which the number of roots of H^ a ^ ^ 
in 1R^_ — as a function of (a, b) — is constant (see also the discussion after the proof of Theorem II .21 
in Section[5]below). In particular, the real curve we see above is the real part of an ^4-discriminant 
variety — a tool (reviewed in Section [2] below) underlying our final main result. 



2 Recall that a set S C R" is star convex iff there is a p £ S such that for all x £ S, the (closed) line segment 
containing p and x is contained in S as well. 

3 The fact that systems in the Haas family have no roots on the coordinate cross, other than the origin, guarantees 
constancy (if we vary the coefficients while avoiding singularities) of the number of roots in each quadrant. 
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1.2.2 New Counts for Topological Types and Discriminant Chambers 

Recall that while a smooth, real, degree d projective plane curve has at most 1 + r*~ *J connected 

components |Har76j . determining the possible nestings of these ovals — a piece of the first part of 
Hilbert's famous 16— Problem |Kal03| — is quite complicated. In more general language, this is 
the determination of possible diffeotopy types of such curves. 

Definition 1.5. Recall that a diffeotopy between two sets X,Y <ZM n is a differentiable function 
H : [0, 1] x W 1 — ► W 1 such that H{t, •) is a diffeomorphism for all t£ [0, 1], H(0, •) is the identity 
on X, and H(1,X)=Y . Equivalently, we simply say that X and Y are diffeotopic. o 

Note that diffeotopy is a more refined equivalence than diffeomorphism, since diffeotopy implies an 
entire continuous family of "infinitesimal" diffeomorphisms that deform X to Y and back again. 
Returning to nestings of ovals of real degree d projective plane curves, an asymptotic formula of e d 
is now known [OK00J, and the exact number is currently known (as of late 2006) only for d<8. 
Via our techniques here, we can study diffeotopy types in a dramatically different setting. 

Definition 1.6. Given any n-variate m-nomial f , its support (or spectrum,) — written Supp(/) 
- is its set of exponent vectors. Also, given any k x n fewnomial system F = (/i, . . . ,fk), let 
Supp(i ? ) := (Supp(/i), . . . , Supp(/fc)). Finally, we let Z + (F) (resp. Z^(F)) denote the set of roots 
of F in R% (resp. (R*) n ). o 

Given ylcZ™ with f^A = n + 3, we let Q denote its convex hull, and make a mild assumption 
that will be removed in future work (see, e.g., |BRS06l Sec. 3.2]): To avoid certain technicalities 
involving topological changes at infinity, we assume that the configuration A is generic in the sense 
that the intersection of A with each facelQ of Q consists of exactly n points. In particular, it 
is a routine exercise in polynomial/linear algebra to show that when the points of A are chosen 
uniformly randomly from [— s, s] n n Z n , the failure probability of our genericity hypothesis decays 
like 0{l/s n ) as s — ► +oo. 

Theorem 1.3. For any fixed AgH 1 with j^A = n + 3 and satisfying the genericity assumption 
above, there are no more than (10 + 6n + n 2 ) (l6 + 8n + ?i 3 + 4p • (n + 2) 2 (n + 4) 2 ) diffeotopy 
types for any smooth Z^{f) with Supp(/)=.A. In particular, the preceding bound (a) is no larger 
than 2 ^(n + 4) 6 , and (b) is completely independent of the coordinates of A. 

In contrast to the situation for degree d plane curves, it is interesting to note that in our n-variate 
(n + 3)-nomial setting, the number of possible diffeotopy types is much closer to the maximal 
number of compact connected components of Z + {f): The latter number was recently shown to be 
no greater than [BS06] . In view of Theorems 11.21 and 1 1 . 3\ it thus appears that the intricacies 

of distinguishing attainable topological types for Z + {f) — for an / with support Ac7* n — might 
begin at j^A = n + 3. 

In particular, for #A = n+l (resp. #A = n + 2), there are at most 2 (resp. 3) smooth diffeotopy 
types for Z + (f). The latter topological bounds are respectively proved in Section 3.2 and Theorem 
2 of [BRS06J, where the resulting algorithmic implications are pursued in much greater depth. We 
are unaware of any other earlier published explicit bounds — depending solely on n and #A - 
for the number of topological types of real fewnomial zero sets. That there are any such bounds 
at all is already a non-trivial fact, first observed by Lou van den Dries around the 1990s via 
o-minimality (see, e.g., |vdD981 Prop. 3.2, Pg. 150]). Thierry Zell has informed the authors that 

4 i.e., face of codimension 1 
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results in |GVZ04| (on Pfaffian functions and quantifier elimination for fewnomials) appear to imply 
an upper bound of n°( n )2°( n > for the number of corresponding smooth topological types in our 
fewnomial setting aboveH 

Our polynomial bound above is thus a great improvement. It is also likely that our bound above 
can be improved even further: the main example we explore in this paper has n = 3 and just 15 
representative chambers, even though the underlying discriminant has over 58 monomial terms. 

As a consequence of our last theorem, we get a constant upper bound for the number of smooth 
topological types of families of real algebraic surfaces defined by "honest" hexanomials. 

Corollary 1.4. Fix Adl? ', satisfying the assumption preceding Theorem \1.3\ and with cardinality 
6. Consider the family of polynomials g with support A, which we will identify with (M*) 6 . Then 
the number of smooth diffeotopy types of any such Z^{g) is no more than 237920. ■ 

Theorems 11.31 11.11 and 11.21 are respectively proved in Sections [3l 01 and [5l The main underlying tools 
are an important recent parametric formula for certain ^-discriminant varieties |DFS05|. Prop. 4.1], and 
recent advances in quantitative estimates for sheared binomial systems (cf. Section [2]) LRW03, BS06J. 

2 Background on *4-Discriminants 

The standard reference for ^.-discriminants is [GKZ94| . For our purposes, we will modify a few 
notions, present motivating examples, and quote some more recent results as well. But first, let us 
recall the following notation (see |Loe91j or [GKZ94, Ch. 1, 9-11] for further background). 

Definition 2.1. Given any A = {ai, . . . , a m }cZ n of cardinality m, we let X4 — the (projective) 
toric variety associated to A be the closure of the parametrized subvariety 

{[t ai : ■■■ : t am ] I t = (h, ...,t n )e (C*)™} o/P™ -1 . Finally, we define V A — the ^-discriminant 
variety — to be the closure of the set of all [ci : • • • : c m ] € P™ _1 such that the hyperplane 
{c\Z\ + • • • + c m z m = 0} intersects a regular point of with a tangency. o 

V.4 also happens to be the closure of those [ci : ■ ■ ■ : c m ] eP™ _1 such that the complex zero set of 
SI^i CiX ai possesses a singularity in (C*) n [GKZ941 Prop. 1.1]. 

Example 2.2. It is a simple exercise to verify that V^i^} the set of all [a : b : c] SP^ such that 
the quadratic polynomial f(x) :=a + bx + ex 2 has degree < 1 or a double root in C. In particular, 
V {0,1,2} i s the projectivized complex zero set of b 2 — Aac, and one can check that the affine real zero 
set of b 2 — 4ac in R 3 is a double cone. Topologically, Pj| Pi V{o,i,2} ^ s thus a circle, o 

In particular, V.4 is an irreducible algebraic variety defined over Z (see, e.g., |GKZ94|, Prop. 1.3, 
Pg. 15]), and this motivates the following important definition. 

Definition 2.3. When codimV„4 = l ; we define (up to sign) the .A-discriminant, A^GZ[ci, . . . ,c m ], 
to be the (irreducible) defining polynomial 0/V4. Otherwise, we set A^:=l. o 

Example 2.4. Continuing our last example, we can take A{ 0i i i 2} = & 2 — 4ac. In particular, observe 
that for real (a, 6, c), f has exactly 0, 1, or 2 roots according as A{ , 1,2} ( a ; b, c) is negative, zero, or 
positive. Note also that the classical quadratic formula tells us that the roots of f are differentiate 
functions of the coefficients (a,b,c), provided [a : b : c] V^i^}- 

5 As this paper was being finalized, Saugata Basu informed the authors that a similar (exponential) bound is 
likely also provable via |B V06| . but with the advantage that the support can be varied when n and #A are fixed. 
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Example 2.5. For A := {(0, 0), (1, 0), (2, 0), (0, 1)}, it is easily checked that is exactly 
{[a : b : c : d] \ b 2 — Aac = d = 0} and thus A_4 = l. o 

^.-discriminants are central in computational algebraic geometry, containing all known mul- 
tivariate resultants as special cases (see, e.g., |GKZ94( Prop. 1.7, pg. 274]). More to the point, 
they are notoriously difficult to compute: (1) detecting just their vanishing is NP-hard already for 
AcZ 2 [Pla841lKoi97] . and (2) ^-discriminants can have many monomial terms, already for _4cZ 
and #^4 = 4 [RY051 Sec. 1.2] (see also Example 12.91 of the next section). 

Relevant to our applications, the fact that the vanishing of A_4 determines when certain hyper- 
surfaces possess singularities readily implies that the real complement of V_4 can be used to encode 
the number of real roots of certain polynomial systems. In particular, let us call any connected 
component of P™" 4-1 \ V.4 a(n) (A-) discriminant chamber. Lemma 12.221 of Section 1 2.21 below 
(see also [GKZ94. Ch. 11, Sec. 5]) relates the number of ^.-discriminant chambers to the number 
of smooth topological types attainable by the real zero sets of certain families of sparse polynomial 
systems. Understanding discriminant chambers is thus a feasible route toward understanding the 
maximal number of real roots of F. 

Remark 2.6. When jfA£ {n + 1, n + 2}, A- discriminant chambers turn out to have a very simple 
structure: just one (resp. at most two) chamber (s) for jfA = n + 1 (resp. f^A = n + 2) \GKZ9J\ 
Prop. 1.8, Pg. 274]- Hence our focus on #A = n + 3 throughout this paper, o 



2.1 Studying .4-Discriminant Chambers 



In what follows, we will always assume that A is a subset of Z n , of cardinality m, that affinety 
generates Z n . We will also frequently assume that OGi; in which case A affinely generating 
Z n is equivalent to Z n being generated by the set of all integral linear combinations of the remaining 
vectors in A. In general, these assumptions can easily be enforced by merely translating A and, if 
necessary, applying the Hermite factorization [Sto98| for integral matrices, „4-discriminants happen 
to be invariant (modulo a permutation of coordinates) under affine transformations of A that 
are injective and integral |GKZ94l Prop. 1.4, Pg. 272]. For now, let us observe a simple example. 

Example 2.7. Note that while ^4:={3, 30, 57} does not affinely generate Z or contain the origin, 
this A is a shifted multiple of a set that does both: {0,1,2}. In particular, the polynomial 



ax"' + bx M + cx 4 has a degenerate root in C* iff the polynomial ax + bx + c has a degenerate root 



r;57 _|_ _|_ h„n n Annnnnvnl,, inn (P * n ft +h n r. rjl 

in C*. Thus, we clearly have V{ 3) 3 ,57} = V{ 0l i,2} and A { 3 i30)57 } = A {0)1)2 }. o 
Remark 2.8. We will frequently abuse notation by also identifying A with the nx m matrix 



where ai 



"ii, 



for all i. Also, given vectors Vi, . . . , v m G W 1 , we will let [v\, . . . , v m ] denote the 



nx m matrix whose j— column is Vj. Finally, we will use the notation ^faii— >"»•] •—(z Vl , . . . ,z Vm ). o 

The combinatorics of exponents will thus be particularly important throughout our 
development. Continuing this theme, we will define notions useful for simplifying discriminant 
chambers. However, before going into further definitions, let us first motivate the need for simpli- 
fication via a more intricate ^.-discriminant example. 



Example 2.9. Suppose A := 



6 
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The real part of is at 



the heart of Theorem and it is interesting to observe that A_4 happens to be rather large: 
A_4(l, a, —1, 1, b, —1) is (up to sign)... 

1102507499354148695951786433413508348166942596435546875-516440160351044111358464119738658142157348733522052 a 35 
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+ 54 additional monomial terms of comparable size 
-24519711093887016527058411574716512472434688 ft 46 a 39 + 82754024941868680778822139064668229594467072 f> 47 a 33 . 
In particular, A_4 (l, |y, —1, 1, |y, — l) 7^0, and (via the proof of Lemma \2.22\ below) this is equiv- 
alent to the fact that #(44/31,44/31,3) has no degenerate roots. In any event, it should be clear that 
we need a more efficient means of addressing V^. 

A beautiful recent (re) discovery is the fact that while A_4 can be unwieldy, always admits a 
compactly expressible parametrization: the Horn-Kapranov Uniformization (see also |Kap91 
lPT05| ). 



Theorem 2.10. [DFS05, Prop. 4-1] Given A:= {a±, ... ,a m } G Z n , the discriminant locus V.4 is 
exactly the closure of 



{[ Ul t ai 



LL m L 



u:- 



■.( Ul ,...,u m )€C m , Au = Q, TZi u i=°i t = (h,...,t n )e(C*) n }. 



Example 2.11. Continuing Example \2.9l it is easily checked that the set of vectors {(u\, . . . ,uq)} 
needed to form the parametrization from Theorem \2.10\ is a vector space with basis 

{(-2, 35, -33, -12, 0, 12), (-2, 11, -9, -4, 4, 0)}. 
We thus obtain that, in spite of the huge formula for A_4(l, a, —1, 1, b, —1) we saw earlier, V^CPr 



is exactly the closure of 

{ [-(2A + 2)4 : (35A + : -(33A + 9)t s 



-(12X + A)ti4 :4ti<i : 12Aiit 2 ] I AgC, (h, t 2 , t 3 ) G (C*) 3 }. o 



Computationally, however, we will need to express V_4 in an even more efficient manner. In 
particular, if we are studying the topology of the zero set of a polynomial as we vary its coefficients, 
we should certainly take advantage of the various homogeneities that preserve the topology of the 
underlying zero set. 



Example 2.12. Returning to Examples \2.9\ and \2.l![ note that the set of exponent vectors of 
g(x, y, v) := c\x 6 + c 2 y 3 + c%y + v{c^y % + C5X 3 + c^x) is exactly A, and that the topology of Z^(g) 
is preserved under nonzero scalings of the coefficient vector (ci, . . . , c%) G (M*) 6 , and nonzero scal- 
ings of the variables v,y,z. In particular, if we would like to find (a,/3, 7, 6) G (ffi*) 4 such that 
5g(ax, (3y, jv) = x 6 + ay 3 — y + v(y G + bx 3 — x) for some real a and b, then we must clearly solve 
the binomial system 



6 O O 1 
O 1 6 O 
OOll 
1111 



(a, (3, 7, 5) L' 1 1 ^ = ( 

c l 1 j — c 3 c 4 X ' — c 6 X )- multiplying and dividing equations (mimicking 
Gaussian elimination), one can then derive that 



Cl 



C4 — cc 

Cl ' Cl 



L°J and b=^- 

Cl 



-e -6 —s 



— C3 £4 — C(j 
Cl ' Cl ' Cl 

"6 1" 
16 
11 
1111 

and differ only in sign. So one can check by hand (or via Lemma \2.13\ and Proposition 2.17\ below) 



Note in particular that the determinants of the matrices 





'-6 -6 -5" 


and 


1 6 




1 1 



are both odd 



that one can indeed always find such real (a,b). More to the point, we have reduced the study of 
Z^(g) from 6 to 2 parameters, o 

Lemma 2.13. Suppose A = {a±, . . . , a m } C IT 1 affinely generates Z n and a\ = O. Then there are 
i\, . . . , i n G {2, . . . , m} such that detfa^ , . . . , a^J is odd. ■ 

Definition 2.14. Suppose A = {a±, . . . ,a m }cZ n affinely generates Z" has cardinality m>n + 2, 
anda\ = 0. We call any set C = {i±, . . . ,i n } with detfa^, . . . , a,i n ] odd as in Lemma \2.13\ above, an 
odd cell of A. For any nx m matrix B, we then let Be (resp. Be) denote the submatrix of B 



S 



defined by columns of B with index in C (resp. {2, . . . , m} \C). For any vectors v,w£ (C*) m , let 
us denote their coordinate-wise product by v -w:= (viW\, . . . , v m w m ). Also let T be the multivalued 

function from (C*) m to (C*) m ~ n ' 1 defined by F(y) := • (ijjjA ° °' . Finally, we define the 

reduced A- discriminant variety, V^cC m_n_1 , to be the closure of 

|T(u) | u:=(ui, . . . ,u m )e(C*y™, Au=0, ££^ = 0}, 
and call any connected component of (M*) m ~ n ~ l \ V_4 a reduced (A-) discriminant chamber, o 

Remark 2.15. Since we always implicitly assume that an odd cell has been fixed a priori for our 
reduced A- discriminant varieties, T in fact restricts to a single-valued function from (K*) m to (R*) TO ~ n . o 



(The underlying A- discriminants are left 



Example 2.16. Continuing Examples \2.9[ \2.11\ and \2.12\ let us shift our original A slightly to 

"0 —6 —6 —6 —3 — 5~ 

instead work with .4=03 l 6 o o 

1 1 1 

unchanged thanks to affine invariance \GKZ94 . Prop. 1.4, Pg. 272].) As observed in Examvle \2.12\ 
C = {3, 4, 6} is an odd cell for A. So Ac is then the 3x3 matrix from Example \2.11\ Ac' = 



-6 —3 
3 
o 1 



, A c }Ac> 



33/35 —12/35 
12/35 2/35 
-12/35 33/35 



, and thus T{u) 



12/35 12/35 

2/35 33/35 12/35 J 12/35 2/35 33/35 



Defining £\, ... ,£§ respectively as the polynomials — 2A — 2, 35A + 11, — 33A — 9, — 12A — 4, 4, 12A, 
and letting ^f(X) := (ipi( A), ^(A)) :=r(^i(A), . . . ,£q(X)), we thus obtain that V.4 is the closure of 
(*(A) | AGC, 4(A)---4(A)t^0}. o 

Let us now consider how the topology of Z^(f) changes as / ranges through ^.-discriminant 
chambers. First, note that it is easy to show that for any ZC(R*) n and any coordinate reflection 
a : W 1 — >M. n , Z and o~(Z) are not only diffeomorphic but diffeotopic. With just a little more work, 
one can then show the following: 

Proposition 2.17. Suppose A={a±, . . . ,a m }cZ"' affinely generates Z n , has cardinality m>n + 2, 
andai = 0. Also let C be any odd cell of A, let f(x):=Y^ 1 S i x ai with 5 := (6±, . . . , S m ) G (R*) m , 
and let 6 G (M*) m be the unique vector with S\ = 1, 6c = (1, • • • , 1) and 5c = r(5). Finally, let 
f-=YliLi 5iX ai and let Conv^4 denote the convex hull of A. Then: 

1. r induces a surjection from the set of connected components of 

P™- 1 \(V A U{[y 1 :---:y m ]eF™- 1 \ y 1 --- Vm = 0}) 
to the set of reduced A- discriminant chambers. 

2. If, for all facets Q' of Conv^4, we have that jf(Ar\ Q') = n, then Z^(f) and Z^(f) are 
diffeotopic. Furthermore, for any f\ and fi with f\ and fi lying in the same reduced A- 
discriminant chamber, Z™(fi) and Z^f?) are diffeotopic. ■ 

Proposition [27T7] follows easily from a routine application of the Smith normal form and the implicit 
function theorem. In particular, the crucial trick is to observe that exponentiation by Ac, when C 
is an odd cell, induces an automorphism of orthants of (M*)' 1 . Our assumption on the intersection 
of A with the facets of Conv.A ensures that any topological change in the zero sets of / and / (in 
the underlying real toric variety corresponding to Conv^l |Ful9 3j ) occurs within (M*) n . 

Remark 2.18. It is also easily checked that our genericity assumption (on the intersection of A 
with the facets of ConvA) implies that (a) the nullspace of A is not contained in any coordinate 
hyperplane, and (b) when ffA = n + 3, one has that V^t n M 2 is not contained in any line, o 



6 The multiple values arise from the presence of rational exponents, and the number of images of a point is always 
bounded above by a constant depending only on A. 
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Definition 2.19. Following the notation of Definition \2A4\ given any 5 G (C*) m , let 5' be the 
unique vector with 5[:=1, 5' c = l, and 5' c , = 8c>- We then define the reduced „4-discriminant to 
beA A (Sc)--=A A (5'). o 

2.2 Going Beyond One Polynomial Via the Cayley Trick 

We will need one last construction in order to apply ^4-discriminants to systems of equations. 

Definition 2.20. Let ej denote the i— standard basis vector of M.°° . Then, for any A\, ... ,^Cl n , 
we callC&y(Ai, . . . , Ak) ■= (A\ x {0})U(.A2 x {e n+ i})U- • -U(.AfcX{e n+ fc_i}) the Cayley embedding 
of (A\, ■ ■ ■ ,Ak)- We also define the Newton polytope of f to be Newt(/) :=Conv(Supp(/)) and, 
for any compact set B C M. n and w = (wi, . . . , w n ) G M. n , we define B w — the face of B with inner 
normal w — to be {(xi, . . . , x n ) G B \ x\Wi + • • • + x n w n is minimized}, o 

Example 2.21. Taking (Ai,A2) to be the support of any H^j, ^ in the Haas family, observe that 
Cay (.4.1, ^2) is exactly the A from Examples \2. 9\\2^TR o 

Lemma 2.22. (See, e.g., \GKZ94\ Prop. 1.7, Ch. 9 and Pg. 380].) Suppose /1, fi G M.[x, y] are 
bivariate polynomials with respective supports A\ and A2, and let i f7 :=(/i, /2)- Assume also that 
for all io£R 2 the pair of faces (Conv^i)™, 00^(^42)^) never consists of two parallel edges. Then 
- identifying the coefficient of x a in fi with the point (a,0) or (a, 1) of Cay(^4i, ^2) according as 
i is 1 or 2 — the number of isolated real roots of F := (/1, f2) in any open quadrant is constant 
on any discriminant chamber of Cay (^.1,^.2 )• 

Proof: Fixing an ordering on A\ and A2, let denote the coefficient vector of any fi with 
support Ai. Also let V ( _ 4li _ A2 ) denote the closure in p#- A i+#- 4 2-i of those |" c (i) . c (2)j such that the 

corresponding F :={f\, f2) has a degenerate root in (C*) 2 . Via [GKZ941 Prop. 1.7, Ch. 9 and Pg. 
38 °]> v (^i,^ 2 ) is exactly V CajMl]A) . 

Note also that by our assumption on the (Conv(^4i) w , Conv(^42)^) (using w G {ei, 62}), the only 
possible isolated root of F on the coordinate cross is (0,0). To conclude, we need only to observe 
that if [S] eP*" 4 '" 1 \ V.A and C W has no zero coordinates for all i G {1, 2}, then all the roots of 
F in the toric variety corresponding to Conv(^4i) + Conv(^42) he in (C*) 2 . Thus, along any fixed 
path within any fixed Cay(_4i, ^,2)-discriminant chamber, the roots of F are continuous functions 
(with bounded range) of the coefficients. ■ 

3 The Proof of Theorem [L3] 

The reduced ^.-discriminant variety has many interesting properties that we will exploit. Before 
proving Theorem 1 1.31 however, we will need an important recent bound on the number of real roots 
of certain structured polynomial systems. 

Definition 3.1. Suppose £±, . . . ,£j GM[Ai, . . . , are polynomials of degree < 1. We then call any 

system of equations of the form S := (l - l\{ =1 ^''{Xi, ■ ■ ■ , A fc ), . . . , 1 - ~[\{ =1 ^(Xi, A fc )) , 

with bi^i G IR for all i, i' , and the vectors (i>i,i, ■ ■ ■ , bi,j), ■ ■ ■ , (&fc,i, • • • , frfcj) linearly independent, a 
k x k sheared binomial system with j factors. We also call each li a factor of the system. 
A sheared binomial system is referred to as a Gale Dual System in IBSOb^ .o 

Note that our definition implies that j>k. For j = k, it is easy to reduce any k x k sheared binomial 
system with j factors to a k x k linear system, simply by multiplying and dividing equations 
(mimicking Gaussian elimination). For j > k, sheared polynomial systems become much more 
complicated. 
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Theorem 3.2. The number of non- degenerate roo ts A 6 M fc o/ any k x k sheared binomial system 
with n + k factors, and all factors positive, is bounded above by: 

1. \LRW03[ Lemma 2] n + 1 (and the same bound applies if we also count degenerate isolated roots 
with all factors positive), for k = l, 

2. mSOfi] (e 2 +3)2( fc - 4 )( fc+1 )/ 2 n fc ; for allk>\. 
In particular, e 2 + 3^10.38905610. ■ 



We will also need one last important result before our main proof. Let \1/(A) = r(^i(A), . . . ,£ m (X)) 
be the dense parametrization of a reduced ^4-discriminant variety associated to an odd cell of A as 
in Definition 12.141 We can in fact consider this map to be defined over P£. by instead working with 
the homogenizations ■ A2]) :=a^Ai + (3iX 2 - Also, recalling Definition 12.191 and the fact that 

A_4 is an irreducible defining polynomial for with integer coefficients, let Z be the finite set of 
points of V^4 at which the gradient of A A vanishes. 

Lemma 3.3. With the notation above, i/ \&([Ai, A2]) G (R*) 2 n (V^. \ Z) then [Ai : A 2 ] can be 
chosen in P^. In other words, (R 2 n V^) \ ^(Pr) is finite. 

Proof of Lemma 13. 3t The proof is an easy consequence of the fact that ^ defines a multivalued 
function (univalued from the real points, as we have already noted in Remark I2.15|) which is an 
inverse to the logarithmic Gauss map G : (C*) 2 n {SI A \ Z) — > P^ : 



G{y) :- 



yi^-A A (y) : y 2 ^-A A (y) 
dyi dy 2 



This is proved as in [CD061lPT04| , Now, since G has rational coefficients, G(y) has real coordinates 
for each real point y £ (R*) 2 n {S7 A \ Z). ■ 

We are now ready to prove Theorem 11.31 

Proof of Theorem II. 3t Let T A denote the toric variety corresponding to the convex hull of A 
[Ful93j . Note that by our assumption that every facet of A corresponds to the vertices of a simplex, 
the complex zero set of any / with Supp(/) =^4 is thus always nonsingular at infinity, relative to T4 
(see, e.g., |BRS061 Sec. 3.2]). By Proposition 12.171 h then suffices to show that our desired bound 
applies to the number of reduced A-discriminant chambers. Note also that by Proposition 12.171 
and Lemma 13.31 the real part of the reduced ^.-discriminant variety — M. 2 n V.4 — must be the 
union of a finite set of points and the closure of {\£(A) | A G R, £i(\) ■ ■ ■ £ n+ 3(\) 7^ 0}, where 

*(A) := (Vi (A), ^(A)) := (nr=i 3 ( A )> nr=i 3 4 2 '' W) > and *1, . . . , 4+ 3 are univariate polynomials 



in A of degree < 1 Let Q C M. 2 denote the aforementioned closure. Since isolated points do not 
disconnect connected components of the complement of a (locally closed) real algebraic curve, 
it thus suffices to focus on f2. In particular, the connected components of (IR*) 2 \ $7 are (up to 
the deletion of finitely many points) exactly the reduced ^4-discriminant chambers. Note also, by 
observing the poles of the ipi, that f2 is the closure of the union of no more than n + 4 arcs, i.e., 
homeomorphic images of the open interval (0, 1). 

To count the number of connected components of (IR*) 2 \ 17, we will use the classical critical 
points method |CG84j . combined with our more recent tools. In particular, let us first bound the 
number of x-axis intersections, cusps, vertical tangents and vertical asymptotes, and nodes of ft 
in M 2 . (These constitute all possible critical points of the orthogonal projection mapping O to the 
first coordinate axis.) Let us call the numbers of these respective objects Mq(A), Mi(A), M 2 {A)^ 
and Ms(A), and proceed with bounding their number from above. 
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x-axis Intersections: Clearly, $7 intersects the x-axis iff ip2 (A) = 0, and the latter occurs iff a 
monomial in the ii vanishes at some A G C U {±00}. Also, by Theorem 12.101 and Definition 12.141 
the degrees of the numerator and denominator of ^2 are equal. So there are clearly no more than 
re + 2 solutions to 1^2 (A) =0, and thus Mq(A) < n + 2. 



Cusps (and certain isolated real points): To count cusps, it suffices to bound from above the num- 



ber of complex A such that = = 0. (Note also that for such a A, \£(A) could also be an 
isolated real point of the real part of V_4 off of 17.) In particular, via the product rule, and by 
dividing out by suitable monomials in ^i(A), . . . ,£ n+ 3(A), the preceding equation reduces to a uni- 
variate system of the form Y17=i ^(A)^ = Y17=i ~1^7Xp = ^- We can then multiply through by 
£i(X) ■ ■ ■ £ n +3(A) to obtain a univariate polynomial of degree re + 2. Furthermore, it is easily checked 
that the maximal number of distinct cusps as A — > ±00 is one, and thus M\(A) <n + 3. 



I Vertical Tangents (and vertical asymptotes) : Here, we proceed essentially the same as for cusps, 
but with only one derivative. However, AG {±00} can possibly yield two distinct vertical tangents. 
So M2(A) <re + 4. Note also that this count includes all cusps. 



Nodes: Here, we need to bound the number of 2-sets {A, A'} (so A 7^ A') with ^(A) = ^(A') 
and A,A'gRU {—00, +00}. Counting these real pairs then reduces to counting the number of real 
solutions of a 2 x 2 sheared binomial system — with < 2(n + 3) = 2n + 6 factors — where no factor is 
zero. Theorem 13 . 21 counts such solution satisfying certain sign condition for the ii, so let us carefully 
count the number of (nonzero) sign combinations possible for the vectors (£i(A), . . . ,^ n+ 3(A)) and 
(^i(A'), . . . ,£ n+ 3(A')): Clearly, each such a vector admits at most (re + 4) possible (nonzero) sign 
combinations, since the sign of any ii is constant to the right (or to the left) of its unique real 
root, and there are no more than re + 3 real roots for our ii. Thus, there are at most (n + 4) 2 
possibilities for the (nonzero) sign vector of (^i(A), . . . ,£ n+ 3(A),£i(A), . . . ,i n+3 {\)). So, combining 
with Theorem 13. 21 and noting that there are < (2n + 4) + 2 factors, we thus clearly obtain no more 
than (n + 4) 2 • (e 2 + 3) • 2" 2 • (2n + 4) 2 pairs (A, A') G R 2 with #(A) = ^(A') and A / A'. Note also that 
there are infinitely many solutions of ^(A) = ^(A') of the form A = A', but these are non-isolated 
and thus not counted by Theorem 13.21 So there are no more than e (re + 2) 2 (n + 4) 2 nodes 
arising from 2-sets in R 2 , since our underlying sheared system is symmetric. 

Now, should a AgM yield ^(A) = ^(±00) as a node (for some fixed choice of sign), then ^(A') 
must clearly have a well-defined nonzero limit as A' — > ±00, since we have already counted x-axis 
intersections and vertical tangents. We are thus reduced to counting the number of real roots of a 
univariate sheared binomial, with no factor zero. By Theorem 13.21 again (with <(n + 2) + l factors), 
and recalling our last observation on the sign vector of (£1, . . . ,£ n+ 3), we then directly obtain no 
more than (n + 4) (re + 2) nodes arising from (A, A') G (M U {±co}) 2 \ R 2 . (It is also easily checked 
that should (±00, Too) yield a node, then there is another pair (A, A') Si 2 giving the same node.) 

In summary, we thus obtain no more than 6 (re + 2) 2 (re + 4) 2 + (n + 2) (re + 4) nodes, and thus 

M 3 (A) < (n + 2)(n + 4) [l + ^ • (re + 2)(n + 4) 



Back to Reduced Chambers... To count the number of connected components of (R*) 2 \ Q, let us 
now introduce vertical lines Li, . . . ,Ln exactly at the locations of the y-axis, the x-axis intersec- 
tions, the cusps, the vertical tangents, and the nodes of O. Clearly, any connected component of 

T:=(R*) 2 \(QUL 1 U---UL N ) 
is contained in a unique connected component of (M*) 2 \Q. So it suffices to count the connected 
components of T. To do the latter, observe that N < 1 + Mq [A) + M2 {A) + M3 (^4) (since cusps were 
already counted among vertical tangents via our technique above), and that our lines {Li} thus di- 
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vide (M*) 2 into no more than l+N = l + l + (n+2) + (n+4) + (n+2)(re+4) 1 + ^±3 . ( n + 2)(n + 4) 

= 16 + 8n + n 2 + ■ (n + 2) 2 (n + 4) 2 vertical strips. 

Now note that within the interior of each strip, is smooth, does not intersect the x-axis, 
and has no vertical tangents. So to count components of T within any particular vertical strip, we 
need only bound from above the number of non-degenerate intersections of £1 U {x2 = 0} with a 
vertical line distinct from L\, . . . , Ljv- This clearly reduces to counting the number of real roots of a 
binomial in n + 3 univariate linear forms. Via our earlier sign condition count, and by Theorem 13.21 
once again (with < (n + 2) + 1 factors), the desired upper bound is then 1 + (n + 4)(n + 2). Thus, 
each of our vertical strips contains no more than 2 + (re + 2) (re + 4) connected components of T. 
Taking into account the number of vertical strips, we thus finally arrive at an upper bound of 

(10 + 6re + re 2 ) (l6 + 8re + re 2 + ^ • (n + 2) 2 (re + 4) 2 ) 
for the number of connected components of T, our bound is proved. 

To conclude, note that Assertion (b) follows immediately from our bound, and Assertion (a) 
follows from merely comparing coefficients in the underlying polynomials. ■ 

Remark 3.4. The diffeotopies we have used above are thus obtained by (i) following a path in 
a reduced A- discriminant chamber, and (ii) performing a coordinate reflection. One could cer- 
tainly allow a broader class of diffeotopies, and thus (potentially) greatly reduce the bound we have 
just proved. Diffeotopies obtained solely via (i) are analogues of what are sometimes called rigid 
diffeotopies in other settings (see, e.g., JOKOO^ ). o 



(as in Example \2.16\) . we will see i 



in 



Remark 3.5. Isolated points can in fact occur in the real part of a reduced A- discriminant 

"O —6 —6 —6 —3 — ! 

variety. For instance, taking A= o 3 l 6 o o 

OOOlll 

the next section (and the Appendix) that M 2 nV„4 is the disjoint union of a connected finite union of 

smooth arcs and exactly 3 isolated points (located respectively in the H — , , and — + quadrants). 

This in turn implies that the real part of the (non-reduced) discriminant variety V.4 has connected 
components of codimension at least 2, even though is itself codimension 1 and irreducible over C. o 



4 The Proof of Theorem 11.11 

Before going into our main proof, let us first review an important criterion for an "approximate" 
root to converge quickly under Newton iteration to a true root of a polynomial system. 

Definition 4.1. \Sm a86. BCSS9^ Given any analytic function F : C n — ► C n , we let F' denote 
its Jacobian matrix, and define the Newton endomorphism, Np : C n — > C n to be the function 
Nf(z):=z — F'{z)~ 1 F{z). Also, given any zo£C n , we define the sequence o/Newton iterates of 
zq (under F) to be (z n ) ne jqij^ where z n +i :=NF{z n ) for all n>Q. Finally, given any multi-linear 

operator C : (C N ) k — ► and v = {v\ . . . , vn) S C , we let \v\ := yjuip + • • • + |^tv| 2 denote the 

I civ 1 ,. ..,v k )\ 

usual Hermitian norm and let \C\ be the multi-linear operator norm max 1 , , r~rn ■ o 

(v\...,vk)e(c N \o)k H-KI 

Definition 4.2. [Sm,a86. BCSS98] Following the notation of ' Definition ^. 1\ we define the invariants 

(3(F,z) := \z - N F (z)\ = \F'(z)-'F(z)\, 7 (F,z) :=sup fc > 2 \^F' (z)- 1 F( k \z)\ 1/{k ~ 1] , and a(F, z) := 
P(F, z)j(F, z). Also, let us call a point zqGC 1 an approximate root of F iff the Newton iterates 

of zq under F satisfy |£ — z n \ < (tj) |C — z o\ f or Q-H n>l, for some true (and non- degenerate) 
root C£<C n of F. o 
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Remark 4.3. Note that F' (z)~ l F^ is a symmetric k-linear operator from (C n ) k to C n , and that 
the underlying coefficients can be identified with k— order partial derivatives of the /, constituting 
F. In particular, observe that when F is a polynomial system, the supremum in the definition of 
■~f(F,zo) is in fact a maximum over a finite set with cardinality depending on the degrees of the fi. o 

Note that approximate roots (as defined above) give an efficient, rigorous, and numerically 
feasible way to encode true roots: For instance, instead of specifying n (likely huge) minimal 
polynomials for an algebraic point (Ci?--->Cn); an< i n corresponding isolating intervals, we can 
instead simply give an n-tuple (zq , . . . , Zq ) that is an approximate root. One can then extract 
arbitrarily high accuracy through a small number of Newton iteration, thanks to Definition 14.21 

There has been much important work on proving useful complexity bounds for this approach 
(see, e.g., [CSMP03]). In the space here, we merely point to |BCSS98j as an excellent beginning 
reference. In particular, the a invariant gives a sufficient criterion to guarantee that a given point, 
and any point sufficiently near, is an approximate root. 

Theorem 4.4. t r BCSS98\ Ch. 8] Following the notation above, suppose zoEC n satisfies a(F,zo)< 
0.03. Then zq — and any point within distance 7 (j^ ) — is an approximate root of F. Furthermore, 
the unique root ( of F to which the Newton iterates of zq converge satisfies \zq — (\ <2f3(F, zq). ■ 

The preceding result is sometimes referred to as a (robust) one point estimate, and considerably 
strengthens earlier seminal results of Kantorovich, from the 1960s, which relied on invariants defined 
over entire regions. We are now ready to prove Theorem ll.il 

Proof of Theorem ll.lt Let H(3) denote the Cayley embedding of the support of any #( a ,6,3) m 
the Haas family. Proposition 12.171 and Lemma 12.221 then tell us that we can find E% by using the 
critical points method jCG84| . just as in the proof of Theorem 11.31 In particular, for the setting at 
hand, this reduces to the following four steps: 

1. computing R(a, b) := A W ( 3 )(— a, —b), i.e., A W ( 3 )(1, a, —1, 1, b, —1) up to sign 

2. computing the partial derivatives of R up to order 2 (needed later to check vertical tangents 
and convexity of the underlying arcs) 

3. isolating those real ao such that there is a real &o with R(ao, &o) — ^jf |( ao ^ = 

4. computing the number of roots of -ff( a ,6,3) i n each quadrant, at representative choices of (a, b), 
picking at least one representative pair from each reduced discriminant chamber 

These computations are routine, albeit hours-long, via Maple. So we now summarize the crucial 
details. 



Steps (1) and (2): We calculate R by first computing the resultant p (resp. q) of h\ (resp. fo) and 
the determinant of the Jacobian of (h\, hi), with respect to the variable y, where (hi, h^) :=Hr a b g\. 

We then compute the resultant, i?GZ[a, b], of p and q with respect to the variable x. R is a multiple 
of our desired R, and we can isolate R by factoring R and picking out the factor of the correct 



degree. In particular, a quick volume calculation (see, e.g., |Roj03 Thm. 4.2.4]) shows that the 



degree of R is bounded above by 236, and there is only one such factor of R. 

It then turns out that R is symmetric, with total degree 90, degree 47 with respect to each 
variable, and exactly 58 monomial terms (each having a coefficient with between 43 and 56 decimal 
digits). In particular, up to sign, R(a, b) is the polynomial from Example 12 .91 and the full monomial 



term expansion of R(a, b) can be downloaded from |http: //www .math, tamu. edu/~rojas/haas3disc 
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Remark 4.5. It is also interesting to observe that R{a,b) is of the form r(a 6 6 1 ,6 6 a 
is a polynomial with total degree 18 and degree 9 with respect to each variable, o 



l ), wh 



ere r 



Step (3): Computing those real qq where both R(ao,bo) and ^\^ aabo ^ vanish can be done by 



computing the resultant of (it!, to eliminate the variable b. The resulting eliminant X(a) 
is large, occupying a text file over 3Mb in size, but has just 4 non-monomial factors — A*, B*, 
C*, and D*, of respective degrees 1260, 70, 35, and 5 — that have real roots. These factors are 
respectively of the form A(a 35 ), B(a S5 ), C(a 5 ), and D(a 5 ), for certain polynomials A, B, C, and D 



of significantly lower degree: D is exactly the minimal polynomial for the rational number 



16807 
2916 ' 



C 



is the degree 7 minimal polynomial for a, and A is the minimal polynomial for the real algebraic 
numbers (5 and 7 (as mentioned in the statement of Theorem 1 1.2j) . B(a) is the following quadratic 
polynomial: 

The critical values are thus the real roots of A* , B*, C*, D*, and there are exactly 16 of them: 
2 of which are negative and 14 of which are positive. With a bit more work, it is then easily verified 
that the — + and +— quadrants of K 2 each contain exactly 2 reduced chambers (each unbounded), 

and the entire quadrant is itself a reduced chamber. Thus, all the action occurs in the ++ 

quadrant, and we will need to consider a total of 16 + 2 + 1 vertical strips in our application of 
the critical points method to our real discriminant curve D:=R 2 nV H(3) (t wo extra vertical lines 
coming from the 6-axis and the unique a-axis intersection of Q). The exact location of our vertical 
strips is detailed in the Appendix. 

There turn out to be exactly 15 reduced chambers in M^_: exactly 5 that are unbounded and 10 
that are bounded. An attractive illustration of the unbounded reduced chambers can be obtained 
by computing the logs of the absolute values of the coordinates of the zero set of R in (C*) 2 , i.e., 
the Archimedean amoeba of R |GKZ94t Cor. 1.8]. In particular, the 5 unbounded reduced 
chambers (and the sole bounded chamber adjacent the origin, stretched to unboundedness — by 
the logarithm — in the illustration below), each contain exactly one of the white convex regions 
below. The remaining bounded chambers thus have images lying inside the shaded amoeba, and 



their boundaries are indicated by the darker curve: the image of ] 
value map. 



nV^( 3 ) under the log of absolute 
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Remark 4.6. The Amoeba Theorem of Gelfand, Kapranov, and Zelevinsky \GKZ9J\ Cor. 1.8] tells 
us that the white convex regions above correspond exactly to the vertices o/Newt(i?). The fact that 
all the facets of the convex hull of?{(3) are triangles implies that Newt(i?) is actually the image of 
the secondary polytope of7i(3) under an injective linear map \GKZ 94, Thm. 1.7 of pg. 221 and 
Thm. 1.4 of pg. 302]. Concretely, this means that the white convex regions also correspond exactly 
to the triangulations of7i(3), which we have drawn above as well^ o 



I Step (4): Now let T* be the complement of £1 U L in (]R*) 2 , where L is the set of vertical lines 
located at the real roots of the eliminant X(a) computed in Step (3). Clearly then, each connected 
component of T* is contained in a unique reduced W(3)-discriminant chamber. 

Returning to the positive quadrant, exactly one of the reduced chambers there (£3) possesses 
H(a,b,3) with 5 positive roots: all other chambers result in 4 or fewer positive roots. Verifying 
this reduces to solving a representative Hr a ^ 3 \ for each connected component of T*: there are 
125 such systems. In particular, after this sampling of representative points (a,b), we obtain that 
-B3 is contained in the union of three adjacent vertical strips, with extreme end-points located 
approximately (to 10 decimal places) at {1.4176759490, 1.4195167977}. We also obtain that within 
each strip, £3 is the region between two smooth curves. This is illustrated below. 




Figure 4.7: The (a,b) yielding Hub,S) with 5 isolated roots in the positive quadrant lie in the 
shaded region E3. The vertices — intersected by the four vertical lines — are a 
subset of the critical points of the projection mapping V-^) onto the a-axis. 

The boundary of E3 thus appears to be a subset of the union of 4 convex arcs. More rigorously, it 
suffices to show that each of the 7 smooth sub-arcs above, obtained from the parametric formula 
of Example 12. 161 is convex away from its cusps and nodes. This follows easily from computing the 
partial derivatives of the logarithms of our parametric formula and checking signs. So E3 is indeed 
non-empty, star-convex, and in fact has 4 vertices determined by the polynomials mentioned in the 
statement of Theorem 11.21 

To conclude, we need only verify that (44/31,44/31) G-E3. Instead of doing this via symbolic 
algebra, let us instead employ Smale's Alpha Theory, as summarized earlier: Clearly, we need 
only check partial derivatives of the hi up to order 6, and a quick computation reveals that any 
z= (zi, Z2) chosen from one of the five following points satisfies 0(^(44/31,44/31,3), z) <0.03: 

( 0.584513273807 , 0.818672114695 ) , ( 0.721441819886 , 0.757201442567 ), 
(0.740238978217, 0.740238978217) , (0.757201442567, 0.721441819886) , (0.818672114695, 0.584513273807) . 



Truthfully, we drew the mixed subdivisions of the pair (Supp(/ii), Supp(/i2)), where H^^^s = (hi, h^). The 
latter diagrams can be interpreted as projections of the triangulations of 7i(3). 
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Thus, each of these points is an approximate root of -££(44/31,44/31,3)- Also, since the roots of 
-££(44/31,44/31,3) are clearly symmetric about the line {x = y}, we need only compute the a-invariant 
3 times. So, thanks to Theorem 14.41 (and Proposition 15.11 of the Appendix), we are done. I 

Remark 4.8. Alternatively, we could have simply used any Grobner basis solver to get a rational 
univariate reduction for -££(44/31,44/31,3)- One could then use Sturm-Habicht sequences [Stu35, Hab48, 
Roy96, LM01] to find (certifiably correct) isolating intervals for the real roots. However, this naive 
Grobnerian approach becomes infeasible for higher degree examples (cf. Note Added in Proof), o 



5 The Proof of Theorem [L2] 



Here, we need only continue the development of the proof of Theorem 1 1 . 1 1 one step further: Since we 
already observed and proved the structure of the boundary of £3 in Section [H we need only verify 
that Ei and £2 are empty, and make two estimates concerning the size of £3. The emptiness of 
E2 follows from essentially the same techniques as we used for £3, but the resulting computations 
(which we omit) are much simpler. The emptiness of £1 follows directly from Bezout's Theorem: 



(a,6,l) 



being a pair of bivariate quadratic polynomials — has no more than 4 non-degenerate 



isolated complex roots. Also, since we already know from Proposition 11.41 that £3 is open, and 
our proof of Theorem 11.11 already showed £3 to be non-empty, we clearly have Area(£s) >0. 

To conclude, since the boundary curves of £3 are concave, the area of £3 is clearly bounded 
from above by the area of the convex hull of its vertices. Maple easily yields the estimate stated 
in our theorem. As for the estimate on the probability, we need only observe that the probability 
is in turn bounded above by the aforementioned area, times the value of the probability density 
function at the lower left vertex of £3. (The lower left vertex is clearly the point of £"3 maximizing 



any function of the form ae 



when a,(3>0.) Via Maple once again, we are done. 



Note that our proof here in essence focussed on one reduced ,4-discriminant chamber, for A = 

-6 -6 -6 -3 -5] 

031600, while the proof in the last section described a collection of sets refining the 
00 1 1 1 J 

true reduced ^.-discriminant chambers. With the calculations we have already done, we can in 
fact describe just the relevant ^.-discriminant chambers, and give the number of real roots for all 
non-degenerate H^ a h ^ . In particular, the intersection of M 2 n V_a with any sufficiently large square 
is, up to diffeotopy, the curve (possessing 3 isolated points) drawn below: 
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Moreover, we see that the number of relevant ^.-discriminant chambers is just 15 and, as detailed 
in the table above, there are just 13 possibilities for the quadruple consisting of the number of roots 
H(a,b,3) i n each real quadrant. A representative set of (a, b) also appears in the table above. 

Appendix: Additional Computational Details 



• The polynomial A(a), mentioned in Theorem 11.11 earlier, is the following univariate degree 36 
polynomial with large integer coefficients: 
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c 

Hi 



We point out that there are 254 digits per line, the largest coefficient of A has 1460 digits, and A 
is irreducible in Z[a] (verified via Maple). 

• A bit later, in the proof of Theorem II A\ we used the critical points method to study the com- 
plement of a reduced discriminant variety in (R*) 2 . The locations (to 10 decimal places), and 
dispositions, of the underlying vertical lines Li and critical values are tabulated below: 



-1.8562718399 


isolated point (on line x = y), root of C* 


-1.1581041767 


isolated point (y coordinate equal to x coordinate of next isolated point), root of A* 





the b- axis 


1.24487176148 


a-axis intersection 


1.41544129863 


cusp (appearing in Figure 4.7), root of B* 


1.41666026637 


cusp (appearing in Figure 4.7), root of B* 


1.41767594900 


node (appearing in Figure 4.7), root of C* 


1.41790510558 


node (appearing in Figure 4.7), root of A* 


1.41821476967 


node (appearing in Figure 4.7), root of A* 


1.41951679775 


cusp (appearing in Figure 4.7), root of D* 


1.43683087662 


node, root of A* 


1.47813022442 


node, root of A* 


1.48488178680 


node, root of C* 


1.59316011321 


node, root of A* 


1.60149022139 


node, root of A* 


1.92733319557 


isolated point, (y coordinate equal to x coordinate of preceding isolated point), root of A* 


2.45494131563 


node, root of A* 


2.47089273858 


node, root of A* 



• Finding the the aforementioned critical values took just over 30 minutes, using univariate 
resultants within Maple 10, on Rojas' dual 2.2Ghz Opteron Linux workstation with 4Gb 
memory. In particular, the locations were certified via use of the realroot command of Maple, 
which uses Sturm-Habicht Sequences |Stu351 IHab48[ |Roy96 , ILM01] to find intervals (with arbitrar- 
ily small size and rational endpoints) containing exactly the real algebraic numbers needed. 

• Finding the location of the critical points, and thus certifying the isolated points, took close 
to 5 days. For the latter computation, we used the gsolve command, and took advantage of the 
fact that the underlying reduced discriminant is a composition of a low degree polynomial with low 
degree monomials (cf. Remark 14. 5p . 

• The aforementioned refinement, T* , of the reduced ^.-discriminant complement turns out to have 
exactly 125 connected components. This was derived by picking rational numbers a% interlacing the 
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locations of the vertical lines Li, and then computing isolating intervals for the real roots of R(a,i, b). 
Finding representative points within each such component then simply amounted to picking rational 

(i) 

numbers bj interlacing these roots. In other words, the representative points are of the form 

(cii,bj ), and each such point was fed into a gsolve computation to find a rational univariate 

reduction (RUR) for H / 1$ \ . Each such RUR was then fed into Maple ' s realroot command, 

[ai,bj ,3J 

and in this way we found the number of real roots of every -ff( a ,ft,3) in the reduced discriminant 
complement. One can definitely see in hindsight — after the proof of Theorem 11.21 in Section 0— 
that there was redundancy in our particular family of chosen H^ a b ^ . 

• The approximations to the 5 isolated roots of -#(44/31,44/31,3) in the positive quadrant were com- 
puted by Jan Verschelde (via his software package PHC-pack [Ver06| ) during the October 2005 
Midwest Algebraic Geometry Conference at Notre Dame University. The computation took a 
fraction of a second on a standard lap-top computer. 

• For any square matrix M, let a{M) denote its largest singular value, i.e., the positive square root 
of the largest eigenvalue of M T M. The computation of the a-invariant for the Haas family can be 
greatly simplified by the following observation, which follows from a routine calculation by hand: 



Proposition 5.1. Let z = (zi,z 2 )eC 2 , M_i(z):= 



sen M k (z): 



6 ,5 f^ 2 -r- 

132^2 _ q fi 5 
31 z l 1 D2 2 



and for any k £ {2, ... ,6} 





44 (3 










0<" 


(!) 



Then a[H( 44 44 3 yzJ < .03 is implied by the truth of 

-1 



a{M^{z)M k {z))<( m forallke{2,...,6}. U 

\ | -M-H 2 )-" (44/31,44/31, 3) W| I 

Note that the right-hand norm is a vector norm. The above simplification allows one to apply 
Theorem 14.41 using just rational operations, after some minor observations involving characteristic 
polynomials of 2 x 2 matrices. Using Maple ' s arbitrary precision arithmetic, one can then easily 
check that just 6 digits of accuracy (for the 5 putative approximate roots preceding Remark 14. 8p 
suffices to yield a values below the critical threshold of 0.03. 

• A skeptical reader may certainly doubt the correctness of the underlying implementations of 
Grobner bases, Sturm-Habicht sequences, and arbitrary precision arithmetic within Maple — but 
hopefully not all at once. This was one motivation behind our use of Alpha Theory. 

• The locations of the vertical lines L\, . . . , Lis — which were central in our proof of Theorems ll.il 
and 11.21 — were independently verified by Bernard Mourrain (via the INRIA Sophia- Antipolis 
software package synaps) and Fabrice Roullier (via the INRIA Rocquencourt software package 
salsa), shortly after the September 2006 IMA Tutorial on Algebraic Geometric Methods in Engi- 
neering. 
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20 



valuable suggestions that clarified and improved this paper. We would also like to thank IMA (the 
Institute for Mathematics and its Applications, at the University of Minnesotta, Minneapolis), for 
their hospitality during the completion of this work. The second author is also grateful for the 
support of M. Danny Rintoul III during the early stages of this work, while he was visiting the 
Computational Biology Group at Sandia National Laboratories. 

Finally, we would like to wish Professor Khovanskii a very happy 60— birthday. We have long 
admired his work and his tremendous talent for turning even the most difficult ideas into simple 
and beautiful explanations. 

Note Added in Proof: In July 2006, Andrew Niles, an NSF sponsored undergraduate student 
(DMS-0552610) in the second author's 2006 REU class at Texas A&M University, found a 2 x 2 
real polynomial system — consisting of a degree 6 trinomial and a degree 141 tetranomial — with 
exactly 7 isolated roots in M.+ . 

Also, around the same time Martin Avendano, a Ph.D. student at the University of Buenos 
Aires, found a new upper bound for the number of real intersections of a line and an m-nomial 
IKriOty . By [LRW03[ Sec. 3, Lemma 1], Avendano' s result immediately implies that 2x2 real 
fewnomial systems of type (3, m) never have more than 6m — 1 isolated roots in The latter 

bound is a serious improvement over the best previous bound of 2 m — 2 \LRW03i Thm. 1 (a)], o 
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